Probabilistic Planning with Risk-Sensitive Criterion

نویسنده

  • Ping Hou
چکیده

Probabilistic planning models and, in particular, Markov Decision Processes (MDPs), Partially Observable Markov Decision Processes (POMDPs) and Decentralized Partially Observable Markov Decision Processes (Dec-POMDPs) have been extensively used by AI and Decision Theoretic communities for planning under uncertainty. Typically, the solvers for probabilistic planning models find policies that minimize the expected cumulative cost (or, equivalently, maximize the expected cumulative reward). While such a policy is good in the expected case, there is a small chance that it might result in an exorbitantly high cost. Therefore, it is not suitable in high-stake planning problems, where exorbitantly high costs should be avoided. With this motivation in mind, Yu, Lin, and Yan (1998) introduced the Risk-Sensitive criterion (RS-criterion) for MDPs, where the objective is to find a policy π that maximizes the probability Pr(cT (s0) ≤ θ0), where cT (s0) is the cumulative cost of the policy and θ0 is the cost threshold. They combine MDPs with the RS-criterion to formalize Risk-Sensitive MDPs (RS-MDPs) and introduced a Value Iteration (VI) like algorithm to solve a typical type of RSMDPs. Liu and Koenig (2006) generalized RS-MDPs by mapping the MDP rewards to risk-sensitive utility functions and sought to find policies that maximize the expected utility—an RS-MDP is a specific case, where the utility function is a step function. They introduced Functional Value Iteration (FVI), which finds optimal policies for general utility functions by approximating it as piecewise linear (PWL) functions. Unfortunately, algorithms like VI and FVI cannot scale to large problems as they need to perform Bellman updates for all states and all break points of their utility function in each iteration. As such, more efficient algorithms can be developed to take advantage of structure in RS-MDPs. In my work, I introduced various algorithms for RSMDPs with different assumptions (e.g., MDPs with dead ends and MDPs with zero or negative cost cycles). In addition to RS-MDPs, POMDPs and Dec-POMDP can also be combined with RS-criterion to formalize Risk-Sensitive POMDPs (RS-POMDPs) and Risk-Sensitive Dec-POMDPs

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Risk-Sensitive Planning with Probabilistic Decision Graphs

Probabilistic AI planning methods that minimize expected execution cost have a neutral attitude towards risk. We demonstrate how one can transform planning problems for risk-sensitive agents into equivalent ones for risk-neutral agents provided that exponential utility functions are used. The transformed planning problems can then be solved with these existing AI planning methods. To demonstrat...

متن کامل

Telecommunications Network Planning Method Based on Probabilistic Risk Assessment

Telecommunications networks have become an important social infrastructure, and their robustness is considered to be a matter of social significance. Conventional network planning methods are generally based on the maximum volume of ordinary traffic and only assume explicitly specified failure scenarios. Therefore, present networks have marginal survivability against multiple failures induced b...

متن کامل

The vanishing discount approach in Markov chains with risk-sensitive criteria

In this paper stochastic dynamic systems are studied, modeled by a countable state space Markov cost/reward chain, satisfying a Lyapunov-type stability condition. For an infinite planning horizon, risk-sensitive (exponential) discounted and average cost criteria are considered. The main contribution is the development of a vanishing discount approach to relate the discounted criterion problem w...

متن کامل

A New Multi-objective Model for Multi-mode Project Planning with Risk

The purpose of this problem is to choose a set of project activities for crashing, in a way that the expected project time, cost and risk are minimized and the expected quality is maximized. In this problem, each project activity can be performed with a specific executive mode. Each executive mode is characterized with four measures, namely the expected time, cost, quality and risk. In this pap...

متن کامل

Wind Integrated Bulk Electric System Planning

The utilization of the wind to generate electrical energy is increasing rapidly throughout the world. By the end of 2009, the worldwide installed wind capacity reached 159,213 MW (World Wind Energy Report 2009). Wind turbine generators can be added and are being added in large grid connected electric power systems. Wind power, however, behaves quite differently than conventional electric power ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015